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RNA dependant RNA Polmerases 



Telomerases 



ms DNAs 



Sc-Est2p 
Ea-p123 
Sp-Trt1p 
hTRT 



Mito. plasmid/RTL 



Group II Introns 



Non-LTR Retrotransposons 



Hepadnaviruses 



LTR Retrotransposons (Copia-Ty1 Subgroup) 



LTR Retrotransposons (Gypsy-Ty3 Subgroup) 
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Retroviruses 



FIG. 6 
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lamda Gphi5 
44022bp 



NC01 19329 
NC01 19932 
INOT1 20063 
BALI 20098 



NC01 29459 
NC01 30730 

SAL1 34841 
NOT1 34874 

NC01 39768 



Left Arm 



pGRN144 
8248 BP 

XH01 3580 
ECOR5 3587 
CLA1 3601 
BGL2 3615 
NC01 3636/ 
TTHIII1 3741 

AFL2 4743 
ESP1 4817 



hTRT5'- 



Right Arm 



EAM 11041 1009 
SCA1 1492 
XMN1 1609 
NDE1 218 
SPLA 2202I 
SNAB1 2220 




HRJW 



HPA1 4874 

AVR2 5699 
AGE1 5915 
EC047III 6078 

MLU1 7020 

SPH1 7746 
\SAL1 7980 
\BAMH1 7992 
KPN1 8001 
ASP718 8001 
SAC1 8007 
ECL136II 8007 
ECOR1 8013 



0- 



ORI <-AmpR CatS-> 

<-Ampsig <-Flori 
<-Amp_pro 



alu Topoll->Sp1 



CAP-> 
CCAAT-> 
CAP-> 
CAP-> . 
CAP-> 



E2F 
E2F 
Sp1 
Sp1 
NFkB 
hTRT5'-> 

E2F" 

lntron1-> 
NFkB 
NFkB 
NFkB 



lntron2-> 
<-betagalRBS 
<-LACpro 



FIG.. 7 
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© 



© 



© 



A = pGRN144 
B = pSEAP2-Basic 
C = BGL2-EC047III fragment from A 
into BGL2-NRU1 sites of B 



PGRN144 
7904 bp 



BGL2 3615 



ORI 



<-AmpR Cat S-> 
<-Ampsig <— Flori 
<-Amppro forward-> 



7 77? 



EC047III 5856 

EC047III 7163 



■fr 



I 

hTRT5' -> <-reverse 
I ntro n 1 -> <-LAC p ro 
<-betagalRBS 



pSEAP2-Basic 
4677bp 




TB-> <-SV40polyA(E) <-AmpR 
Signal-> ORI <-Ampsig 

-SEAP — > <-Amppro 
SV40polyA(L)-> Florh-> 



FIG. 9 
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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-1.2 
TGGGCTTCCCC 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

3 01 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_II_cleavage_site 
RNYNNCNNGYNGKTNYNY 
★ ★*★★****★★★*•*★*★> 

3 61 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 

FIG. 12 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AAT CATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

2 01 TCCAGAAAGT TAT T AGATG A AGAAAT CAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 ■ AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

3 51 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTT CAATGAT TTGGATTTCA 

4 51 ACTTAAGGGA AAT CAATT AG CAAAGACCCA TTTATTAACA GCTCTTTCAA 

5 01 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAAC CAAGT TAGAG CAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT T AAT AT T C CA 

6 01 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
6 51 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
8 01 ATATGAAATC AAGAAC CAGA ATATTTTATT GCACT CATTT TAATAGAAAT 
851 AAC CAATT CT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC G AAT AT AT T C AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

10 01 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAAT CTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGT CGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA AC AAG CATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGAT CT CGTCGTCTCG 
1451 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGT CATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
16 01 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

16 51 ACCGAAGAAA ACTACTTTCC GTC CAATT AT GACTTTCAAT AAGAAGATTG 

17 01 T AAATT CAGA CCGGAAGACT A CAAAAT TAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GA C ATTGAAG AATAGAATGT TTAAAGATCC 

18 01 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 001 CTGCACAAAT T CT AAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT AT CC AAC CTT ATT CAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTT AAATG CA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 

22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
2 251 C CAAT ATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

23 01 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
23 51 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AAT CAATG AA 



FIG. 13A 



+ 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

2 5 01 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TT CAATATGA AGAAACTACA 

2 5 51 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2 6 01 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGG CAT C 

26 51 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 

2 7 01 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAG AAAG CAT 

2751 CAATGTGGCT CAAGAAGAAA CTAAAGT CGT TTTTAATGAA TAACATTACC 

28 01 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2 8 51 CAACAAGTTA TTTATAT CAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2 9 01 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

2 9 51 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 0 01 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 0 51 ACTTTTTCCT TAGCACACTG- AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT" TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
32 01 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3 2 51 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 

FIG. 13B 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

2 01 ADMNE PRCCS TCKYNVKNEK DHFLNNINVP NWNKMKSRTR I FYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

3 01 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 

3 51 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 

4 01 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMS IADLKKE TLxAEVQEKEV 
501 EEWKKS LGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQ PKLFFA 
6 01 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLL.MRLT DDYLLITTQE NNAVLFIEKL 
8 01 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 S I DMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAF FKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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LninLnLnLnininininm^* 

004J4J 01 xJ «J U CT> 4-> U 
CDaJaJxJfTJjJOitjumnj 

CnCnoucncnxJ<daSrdrd 
xJxJCnfdxJrdraurdCnaJ 

XJ 01 XJ XJ XJ XJ u u d u U 

DlOirfl-uJ-UtCCnUOU rd 

xJCnramrdUaJCnrdxJxJ 
xJUfdxJtjiaJiJaJdJfTJCn 

xj m en cn o iJ u oiuioxu 

rdrdrdcnrdrdxjrdcntjxj 
U u rd rd Ol Cn Cn J-> Cn CH xj 
U 4J u CT> xJ xj rd xJ a3 rd D> 

mrdajfdtnxjxjucncjrd 

o cn xj cn xj xj 01 cn xj xj m 
xj xj xj cn xj a) o3 jj O aj 4J 

4JODln3a3DlrtnJnJ CJ)4J 
aJcdcncnrtrdfljxJOrdU 
nj O xJ O xj rd Cn xj 4-» Cn Cn 
xJOcnuuxJxJUainJrd 
xJOrdxJCnucnnjxJCnxJ 
nj Oi (d rd Cn nj cn nj xj cn cn 
njAJtOjJtiJflOiDlOUU 
mocnaJCJcdxJajcncnxJ 
njrccnococnfdnjrGxJO 

tn U n3 aJ 4J DijJ aj tnu 
rdrdxJrdrdrd4-»OCnrdO 
rd Cn as xj cn rd xj xj cn u m 
as xj rd xj <d cn -u aij-)j-) cn 
xJOCnrdUOxJfdaJrdO 
Cn cn cd cn cn rd rd rd ro rd xj 
<tJ u jj cn xj cn rd cnaj cn u 
xj xj xj cn o cts u cn cn xj u 
ucnxjcnxjcnxjucnrdcn 
4-> com cn xj xj xj cn rd cn u 
iJUUdluuun! cn xj aj 
xj cn xj rd cn rd aj oj xj xj cn 
nj ti u.ai u oi ni <d o jj 
CnuxJrdrdOxJOCnxJU 
cnajoajocnoxjuum 
(CCnunJflJjJUUUiJiJ 

rd XJ U CJ CD U U U UCJU 

xj re xj xj cn u xj xj u m xj 
u cn rd xj xj cn xj cn xj cn cn 
A «j aj u x) cn xj Cn xj aj cn 
xJ 05 x) xj x) o x) xi u Cn xj 
fO (C u (i3 <tj u cn xj xj cn to 
u cn cn xj m cn xj u ra cn xj 
cn as cn xj xj cn xj xj ro xj cn 
rdrdrdrdajxjxjxj cn xj xj 
rd xj cn cn cn cn xj xj cn rd rd 
u xj xj m cn cn cn ai m u rd 

CnxJrdrdrdxJCnrdCnxJrd 
rdxJUCnrdrdrordCnrdCn 
fTJOrcuuuasurCxJiJ 
(T5trlxJrdCnUJ-»umcdU 
UxJrdCnrdxJrdCnrdrdxJ 
UaJfdrdaJrdxjrdxJaSxJ 
UJ-JJ-JrCfdxJrdxJxJfdxJ 

u xj cn rd cn xj xj m xj cn m 
uordrdfTJCnxJronjcdxJ 
murdmajcnunjcncdu 
Cn xj m U cn rd xj cn xj cd xj 
xJrdrdCnxJOCnuomxJ 
UxJxJxJxJUxJUUrnu 
CnxJxJrdcnordxJxJfdrd 
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1 gcagcgctgc gtcctgctgc gcacgtggga 
61 gcgcgctccc cgctgccgag ccgtgcgctc 
121 gccgctggcc acgttcgtgc ggcgcctggg 
181 ggacccggcg gctttccgcg cgctggtggc 
241 acggccgccc cccgccgccc cctccttccg 
301 ccgagtgctg cagaggctgt gcgagcgcgg 
3 61 gctgctggac ggggcccgcg ggggcccccc 
421 cctgcccaac acggtgaccg acgcactgcg 
481 ccgcgtgggc gacgacgtgc tggttcacct 
541 ggctcccagc tgcgcctacc aggtgtgcgg 
601 tcaggcccgg cccccgccac' acgctagtgg 
661 ctggaaccat agcgtcaggg aggccggggt 
721 gaggcgcggg ggcagtgcca gccgaagtct 
781 tgcccctgag ccggagcgga cgcccgttgg 
841 gcgtggaccg agtgaccgtg gtttctgtgt 
901 cacctctttg gagggtgcgc tctctggcac 
961 gcaccacgcg ggccccccat ccacatcgcg 
1021 cccggtgtac gccgagacca agcacttcct 
1081 gccctccttc ctactcagct ctctgaggcc 
1141 gaccatcttt ctgggttcca ggccctggat 
1201 gccccagcgc tactggcaaa tgcggcccct 
1261 gtgcccctac ggggtgctcc tcaagacgca 
1321 agccggtgtc tgtgcccggg agaagcccca 
1381 cacagacccc cgtcgcctgg tgcagctgct 
1441 cggcttcgtg cgggcctgcc tgcgccggct 
1501 caacgaacgc cgcttcctca ggaacaccaa 
1561 gctctcgctg caggagctga cgtggaagat 
1621 gagcccaggg gttggctgtg ttccggccgc 
1681 caagttcctg cactggctga tgagtgtgta 
1741 tgtcacggag accacgtttc aaaagaacag 
1801 caagttgcaa agcattggaa tcagacagca 
18 61 ggaagcagag gtcaggcagc atcgggaagc 
1921 cttcatcccc aagcctgacg ggctgcggcc 
1981 cagaacgttc cgcagagaaa agagggccga 
2041 cagcgtgctc aactacgagc gggcgcggcg 
2101 cctggacgat atccacaggg cctggcgcac 
2161 gccgcctgag ctgtactttg tcaaggtgga 
2221 ggacaggctc acggaggtca tcgccagcat 
2281 tcggtatgcc gtggtccaga aggccgccca 
2341 cgtctctacc ttgacagacc tccagccgta 
2401 gaccagcccg ctgagggatg ccgtcgtcat 
2461 cagtggcctc ttcgacgtct tcctacgctt 
2521 caagtcctac gtccagtgcc aggggatccc 
2581 cagcctgtgc tacggcgaca tggagaacaa 
2641 gctcctgcgt ttggtggatg atttcttgtt 
2701 cttcctcagg accctggtcc gaggtgtccc 
2761 gacagtggtg aacttccctg tagaagacga 
2821 gccggcccac ggcctattcc cctggtgcgg 
2881 gcagagcgac tactccagct atgcccggac 
2941 cggcttcaag gctgggagga acatgcgtcg 
3001 tcacagcctg tttctggatt tgcaggtgaa 
3061 caagatcctc ctgctgcagg cgtacaggtt 
3121 tcagcaagtt. tggaagaacc ccacattttt 
3181 ctgctactcc atcctgaaag ccaagaacgc 
3241 cggccctctg ccctccgagg ccgtgcagtg 
3301 gactcgacac cgtgtcacct acgtgccact 
3361 gctgagtcgg aagctcccgg ggacgacgct 
3421 actgccctca gacttcaaga ccatcctgga 
3481 gagcagacac cagcagccct gtcacgccgg 
3541 cacacccagg cccgcaccgc tgggagtctg 
3601 catgtccggc tgaaggctga gtgtccggct 
3661 gagtgtccag cacacctgcc gtcttcactt 
3721 gggccagctt ttcctcacca ggagcccggc 
3781 ccagattcgc cattgttcac ccctcgccct 
3 841 aggtggagac cctgagaagg accctgggag 
3901 ccctgtacac aggcgaggac cctgcacctg 
3 961 gaggtgctgt gggagtaaaa tactgaatat 



agccctggcc ccggccaccc ccgcgatgcc 
cctgctgcgc agccactacc gcgaggtgct 
gccccagggc , tggcggctgg tgcagcgcgg 
ccagtgcctg gtgtgcgtgc cctgggacgc 
ccaggtgtcc tgcctgaagg agctggtggc 
cgcgaagaac gtgctggcct tcggcttcgc 
cgaggccttc accaccagcg tgcgcagcta 
ggggagcggg gcgtgggggc tgctgctgcg 
gctggcacgc tgcgcgctct ttgtgctggt 
gccgccgctg taccagctcg gcgctgccac 
accccgaagg cgtctgggat gcgaacgggc 
ccccctgggc ctgccagccc cgggtgcgag 
gccgttgccc aagaggccca ggcgtggcgc 
gcaggggtcc tgggcccacc cgggcaggac 
ggtgtcacct gccagacccg ccgaagaagc 
gcgccactcc cacccatccg tgggccgcca 
gccaccacgt ccctgggaca cgccttgtcc 
ctactcctca ggcgacaagg agcagctgcg 
cagcctgact ggcgctcgga ggctcgtgga 
gccagggact ccccgcaggt tgccccgcct 
gtttctggag ctgcttggga accacgcgca 
ctgcccgctg cga'gctgcgg tcaccccagc 
gggctctgtg gcggcccccg aggaggagga 
ccgccagcac agcagcccct ggcaggtgta 
ggtgccccca ggcctctggg gctccaggca 
gaagttcatc tccctgggga agcatgccaa 
gagcgtgcgg gactgcgctt ggctgcgcag 
agagcaccgt ctgcgtgagg agatcctggc 
cgtcgtcgag ctgctcaggt ctttctttta 
gctctttttc taccggaaga gtgtctggag 
cttgaagagg gtgcagctgc gggagctgtc 
caggcccgcc ctgctgacgt ccagactccg 
gattgtgaac atggactacg tcgtgggagc 
gcgtctcacc tcgagggtga aggcactgtt 
ccccggcctc ctgggcgcct ctgtgctggg 
cttcgtgctg cgtgtgcggg cccaggaccc 
tgtgacgggc gcgtacgaca ccatccccca 
catcaaaccc cagaacacgt actgcgtgcg 
tgggcacgtc cgcaaggcct tcaagagcca 
catgcgacag ttcgtggctc acctgcagga 
cgagcagagc tcctccctga atgaggccag 
catgtgccac cacgccgtgc gcatcagggg 
gcagggctcc atcctctcca cgctgctctg 
gctgtttgcg gggattcggc gggacgggct 
-99tgacacct cacctcaccc acgcgaaaac 
tgagtatggc tgcgtggtga acttgcggaa 
ggccctgggt ggcacggctt ttgttcagat 
cctgctgctg gatacccgga ccctggaggt 
ctccatcaga gccagtctca ccttcaaccg 
caaactcttt ggggtcttgc ggctgaagtg 
cagcctccag acggtgtgca ccaacatcta 
tcacgcatgt gtgctgcagc tcccatttca 
cctgcgcgtc atctctgaca cggcctccct 
agggatgtcg ctgggggcca agggcgccgc 
gctgtgccac caagcattcc tgctcaagct 
cctggggtca ctcaggacag cccagacgca 
gactgccctg gaggccgcag ccaacccggc 
ctgatggcca cccgcccaca gccaggccga 
gctctacgtc ccagggaggg aggggcggcc 
aggcctgagt gagtgtttgg ccgaggcctg 
gaggcctgag cgagtgtcca gccaagggct 
ccccacaggc tggcgctcgg ctccacccca 
ttccactccc cacataggaa tagtccatcc 
gccctccttt gccttccacc cccaccatcc 
ctctgggaat ttggagtgac caaaggtgtg 
gatgggggtc cctgtgggtc aaattggggg 
atgagttttt cagttttgaa aaaaa 
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MPRAPRCRAVRSLLRSHYREVLPLATFWRLGPQGWRLVQRGDP 



AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRL 
CERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALR 
GSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCGPPLY 
QLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPAPG 
ARRRGGS ASRS L PLP KR PRRGAAPE PERTP VGQGS WAH PGRTRG 
PSDRGFCWS PARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP 
SLTGARRLVETIFLGSRPWMPGTPRRLPRIiPQRYWQMRPLFLEL 
LGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEE 
EDTDPRRLVQLLRQHSS PWQVYGFVRACLRRLVPPGLWGSRHNE 
RRFLRNTKKF I SLGKHAKLS LQELTWKMS VRDCAWLRRS PGVGC 
VPAAEHRLREE I LAKFLHWLMS VYWE LLRS FF YVTETTFQKNR 
h F F YRKS VW S KLQ S I G I RQHL KRVQLRE L S E AE VRQHRE AR PAL 
LTSRLRF I PKPDGLRP I VNMDYWGARTFRREKRAERLTSRVKA 
LFS VLNYERARRPGLLGAS VLGLDD I HRAWRTFVLRVRAQD P P P 
ELYFVKVDVTGAYDTI PQDRLTEVI AS 1 1 KPQNTYCVRRYAWQ 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWI 
EQSSS LNEASSGL FDVFLRFMCHHAVR I RGKS YVQCQG I PQGS I 
LSTLLC S LC YGDMENKL FAG I RRDGLLLRL VDD FLLVTPHLTHA 
KTFLRTLVRGVPEYGCVWLRKTWNFPVEDEALGGTAFVQMPA 
HGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGR 
NMRRKLFGVXjRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRF 
HACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSL 
GAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQ 
TQLSRKLPGTTLTALEAAANPALPSDFKTILD 

FIG. 17 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
C CAGGACAGGCTCACGGAGGT C AT CGC CAGCATCATCAAACCC CAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGAXTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 
AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
, GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAAT ACTGAATATATGAGTTTTTCAGTTTTG 0 AAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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MetSerValTyrValValGluLeuLeuArgSerPhePhe 
TyrValThrGluThrThrPheGlnLysAsnArgLeuPhe 
PheTyrArgLysSerValTrpSerLysLeuGlnSerlle 
GlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 
LeuSerGluAlaGluValArgGlnHisArgGluAlaArg 
ProAlaLeuLeuThrSerArgLeuArgPhelleProLys 
ProAspGlyLeuArgProIleValAsnMetAspTyrVal 
ValGlyAlaArgThrPheArgArgGluLysArgAlaGlu 
ArgLeuThrSerArgValLysAlaLeuPheSerValLeu 
AsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAla 
S erVa 1 LeuGlyLeuAspAsp 1 1 eHi s ArgAl aTrpArg 
ThrPheValLeuArgValArgAlaGlnAspProProPro 
GluLeuTyrPheValLysValAspValThrGlyAlaTyr 
AspThrlleProGlnAspArgLeuThrGluVallleAla 
SerllelleLysProGlnAsnThrTyrCysValArgArg 
TyrAlaValValGlnLysAlaAlaHisGlyHisValArg 
LysAlaPheLysSerHisValLeuArgProValProGly 
AspProAlaGlyLeuHisProLeuHisAlaAlaLeuGln 
ProValLeuArgArgHisGlyGluGlnAlaValCysGly 
AspSerAlaGlyArgAlaAlaProAlaPheGlyGly 

FIG. 19 



i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 



his tyr arg 
CAC TAC CGC 



gly pro gin 
GGG CCC CAG 



phe arg ala 
TTC CGC GCG 



ala arg pro 
GCA CGG CCG 



leu lys glu 
CTG AAG GAG 



20 

glu val leu 
GAG GTG CTG 



gly trp arg 
GGC TGG CGG 

50 

leu val ala 
CTG GTG GCC 



pro pro ala 
CCC CCC GCC 

80 

leu val ala 
CTG GTG GCC 



pro leu ala 
CCG CTG GCC 

4 0 

leu val gin 
CTG GTG CAG 



gin cys leu 
CAG TGC CTG 

70 

ala pro ser 
GCC CCC TCC 



arg val leu 
CGA GTG CTG 



thr phe val 
ACG TTC GTG 



arg gly asp 
CGC GGG GAC 



val cys val 
GTG TGC GTG 



phe arg gin 
TTC CGC CAG 



gin arg leu 
CAG AGG CTG 



30 

arg arg leu 
CGG CGC CTG 



pro ala . ala 
CCG GCG GCT 

60 

pro trp asp 
CCC TGG GAC 



val ser cys 
GTG TCC TGC 

90 

cys glu arg 
TGC GAG CGC 



100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 
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130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAG CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 

CAG CAC CAC GCG GGC CCC CCA 

320 

trp asp thr pro cys pro pro 

TGG GAC ACG CCT TGT CCC CCG 



280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 
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leu tyr ser 
CTC TAC'TCC 



leu ser ser 
CTC AGC TCT 



glu thr ile 
GAG ACC ATC 



arg arg leu 
CGC AGG TTG 



leu phe* leu 
CTG TTT CTG 



val leu leu 
GTG CTC CTC 



ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



•his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



.met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ser gly asp 
TCA GGC GAC 

350 

leu arg pro 
CTG AGG CCC 



phe leu gly 
TTT CTG GGT 

380 

pro arg leu 
CCC CGC CTG 



glu leu leu 
GAG CTG CTT 

410 

lys thr his 
AAG ACG CAC 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



340 

lys glu gin 
AAG GAG CAG 



ser leu thr 
AGC CTG ACT 

370 

ser arg pro 
TCC AGG CCC 



pro gin arg 
CCC CAG CGC 

■ 400 
gly asn his 
GGG AAC CAC 



cys pro leu 
TGC CCG CTG 

430 

arg glu lys 
CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 

leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 



leu arg pro 
CTG CGG CCC 



gly ala arg 
GGC GCT CGG 



trp met pro 
TGG ATG CCA 



tyr trp gin 
TAC TGG CAA 



ala gin cys 
GCG CAG TGC 



arg ala ala 
CGA GCT GCG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



ser phe leu 
TCC TTC CTA 

360 
arg leu val 
AGG CTC GTG 



gly thr pro 
GGG ACT CCC 

390 

met arg pro 
ATG CGG CCC 



pro tyr gly 
CCC TAC GGG 

420 
val thr pro 
GTC ACC CCA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 
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550 

ala lys'phe leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg lys ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 

610 

ser glu ala glu val- arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG. CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

•thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 
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760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCT CAC CT CAC CC ACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCAT CTTTGGACTACAT CTTAGTC C CGCGCTCACAC CTGTGACAGGACTTA 

121 CT CAATGTCTCAGTGTGTGCTGAAACATGTAGAAATT7WVGT CCATC CCT CCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 

3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGAC 



3 61 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



************************************** 

541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGT CTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGT CCA 



601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 

CCAAT 
★#★*★★★**★★****> 

CTTCGAGTGGGGTGAGTTCACAACACCACAAAACT 

721 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 
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CAP 

***************> 

781 ACACACTAACTGCAC C CATAATACTGGGGTGT CTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGAC C CCACAGAAGACC CATAGT CGCTAGAAGTAACTT 

CAP 

************ 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTT 

ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



• * * * > 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

***************> 

961 AACC^GTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCCAAACCGCCCCTTTGC 
TTGGTGACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTT^ 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGAC C CGTAATC CTTAAGTATTT ACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

1201 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAAT CGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTC CGGAGC C CT CTG 

13 21 CCAGAAGTTT CTCGC CCCTTAGAT C CAAACTTGAGCAACCCGGAGTCTGGATTC CTGGGA 
GGT CTTCAAAGAGCGGGGAATCTAGGTTTGAACT CGTTGGGCCT CAGAC CTAAGGAC CCT 

TopoII 
*****************> 

13 81 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCGACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 
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16 81 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCGA.CCAAACGAGTACCACCCCTGGGGAA 

1741 CG C CGC CTGAGAAC CTGCAAAGAGAAATGACGGGCCTGTGT CAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

18 01 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

18 61 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 

GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1 9 21 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 

GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGC C CAATGGGGTGT CGGATCCGGCTAAGCTGGAGAGAGGCGAC C CCGGGAGC 

Spl 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCGCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 
**★+*++* 

22 81 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGGGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCGAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 

Spl 



E2F * NFkB 

24 01 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRT5 1 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 
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E2F 

2581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCAGACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



2701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_II_cleavag 



NFkB 
+++++ +++ + -*- 
NFkB 



Intronl 

2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2941 CTTC^VCCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG * 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 

3 001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3 061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGT CGACGCGGATGGTCCACACGC C CGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

33 01 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 
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3361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGG CACC CGGCGGTCGTGGTGCGC C CGGGGGGTAGGTGTAGCGCCGGTGG 

3 541 ACGTCCCTGGGACACGCCT.TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3 6 01 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGT CCGCTGTTC CTCGTCGACGC CGGGAGGAAGGATGAGTCGAGAGACTC CGGGT CGGA 

3 661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3 721 GACTCCCCGCAGGTTGCGCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3 781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3 841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3 901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3 961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 

CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4 021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 

GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 



4 081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT , 
GT AGAGGGAC C CCTTCGTACGGTT CGAGAGCGACGTC CTCGACTGCAC CTTCTACTCGCA 



4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Intron2 

★★★★★★★★★★★★★it********************************************** 

4201 AGGC C C CAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGC CCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



4321 GTGATCGAGGTCGAC 
CACTAGCT CCAGCTG 
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gccaagttcctgcactggctgatgagtgtgtacgtcgtcgagctgctcaggtctttcttt 
tatgtcacggagaccacgtttcaaaagaacaggctctttttctaccggaagagtgtctgg 
agcaagttgcaaagcattggaatcagacagcacttgaagagggtgcagctgcgggacgtg 
tcggaagcagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactc 
cgcttcatccccaagcctgacgggctgcggccgattgtgaacatggactacgtcgtggga 
gccagaacgttccgcagagaaaagagggccgagcgtctcacctcgagggtgaaggcactg 
ttcagcgtgctcaactacgagcgggcgcg 

FIG. 23 



TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCC 
AGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 
AGGGGCAAGTC 
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pBB5212 pGRN133 




Internal Control 



Approximate Cell No. 5,000 5,000 5,000 5,000 
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NXT Heparin Affinity Glyc. grad. 
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Glyc. grad. 
fractions 
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Gradient 

FIG. 30 
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TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC - 
tttT GGGG-3- 



PANEL A 



TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC • 
G GTTTT-3 1 

-G^ 




PANEL B 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA T AC ATT C AAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

2 51 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

4 01 TTATGGAGAA AATTAC TTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

6 01 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

8 01 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATG CACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 

9 51 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
10 01 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
10 51 AAAGAAGAAG AGCTAT CACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT AT CTTAAC AA TATTTTTTGA TTAGCTGGAA 
12 01 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

12 51 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

13 51 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

14 01 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGT ATT AC C A ATCTTGATTG 

14 51 ATTGAAGAGA TTGACGAGGC AAC TGCACAG AAGATCATTA AAGAAATAAA 

15 01 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 

15 51 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

16 01 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
16 51 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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CC C CAAAAC C C CAAAACCCCAAAAC C C CTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

+ + + + + + 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 



PQNPKT P KP L * KKKKLR * FR 
PKTPKPQNPYKKRKNCGSLE- 
PKPQNPKTPIKKEKIEVV*K- 



' AATAAAATATTATT C C CGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

_i + + + + + 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 



NKILFPHKWRWILIWMI*KI 
IKYYSRTNGDGYCFGCYRKF 
* N I I P A Q M E M D I D L D D IENL 



TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

+ + + + H 4- 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 



YFLIHSTS IAALVVTRKDAK 
TS*YIQQV*QLL*CQERMQN- 
L PNT FN KY S S S C S D KKGC KT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

HCNLARNRLHCLFQSCKNN* 
IEIWLEIAFIDYSKVAKTIR- 
LKSGSKS PSLTI PKLQKQLE- 



AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

+ + + + + + 

TCAAGATGAAGAGC CTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAAT CAAAATT 



SSTSRMQ IFITILSC'EN*F* 
VLLLGC KSL*RFFLEKISFK- 
F Y F ■ S - D A N L Y N*D S F L R K L V L K - 



AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

+ + + + + + 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 



KAESKE* KLKHY* CLNKIR* 
KRRAKS RNCNITNV* IKSGN 
SGEQRVEIETLLMFK*NQVM 



TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

+ - + + + + " + 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 



CGLFYFLDHFLRS IMEKIT* 
EDYSIF*ITS*GALWRKLLN 
R I ILFFRSL LKEHYGENYLI 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + - + + + 48 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSL>DYFPSQQCCVY*I 

b TKR* TVW I ISLANNDEYI KF 

c LKGKQ FGLFP*PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
481 + + + + + + 54 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRISIHQTYQRQTRY 

b I CECVKGSRYIRLTKDKLAI - 

C YENES KDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + go 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKVC* SNSRRTYCIYYS 

b KRKKKFDNRTAEEL IAFT I R 

C NARKS LI IEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 + + + + + + 66 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRRCTPESCDN 

b MGFITIVLGIDGELPSLETI- 

C WVLLQLF *VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 72 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a CKSCLQLKESQFCKFCCVCH 

b EKAVYNCRNRSSESSDVYAI- 

c KKLFTTEG IAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 78 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRN K 

b ILCINLKYLISI*WIAIETN- 

c. FCELI SNILSQFNG* L* KQT- 

CC AAATAAAC CATGCAAGTTTAATGGAATATACGTTAAAT C CTTTGGGACAAATGCAC AC 

781 + + + + + + 84 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKS FGTNAH 

b Q INHA S LMEYTLNPLGQMHT 

C K* TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 
84! + + + + + + 90 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a C I YI GF L KHRYTE CF RDC F S 

b E F I LD S * S IDTQNALETDLA- 

C NLYWI LKA* IHRML* RL I * L - 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LiQQITCFDYSCSSLI SLKEA 

b YNRLPVLITLAHLLYL* KKQ- 

C TTDYLFCLLLLISYI FKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + - + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a .GEMKRRLKKEI SKFVDSSVT 

b AKCKED* RKRFQN LLILL* P 

C RNEKKTKERD FKICCFFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GIN NKNI SNEKEEELSQSCF 

b ELTTR I LATKKKKSYHNPDS 

c N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTC AAAAATTC C AGGTAAGAGAGATACATTC ATTAAAATT CATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKI PGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS 

c KDFKNSR* ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

C FHFTAVI F FYLNNI FCLAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACT.TAGCTTATTCACATTCAT 

1201 + + + - + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI 

C KKYQIREALDCGNLAYSHS * - 

AG ATCGAC CTTC ATATAT C CAATACGATGATAAGGAAACAGCAGT C ATC CGTTTT AAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFI.YPIRC*GNSSHPF*K - 
b DRPSYIQYDDKETAVIRFKN- 
c I DLHI SNTMI RKQQ S SVLKI- 

TAGTGCTATGAGGACTAAATTTTTAG AGT CAAGAAATGGAGC CGAAATCTTAAT CAAAAA 

1321 + + + - + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a *CYED* IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAEILIKK- 

c VLCGLNF * SQEMEP KS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRR YCKRIEL* I F R * * ' V L P 

b- NCVDIAKE SNSKS FVNKYYQ 

c IASILQKNRTLNLSL I S I T N - 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + -- + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a I L I DCRDCRGNCTEDH * RNK 

b SCLIEEIDEATAQKI IKEIK - 

C LDCLKRLTRQLHRRSLKK* S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIM*RIN*ITNIEISDLQ 

b *LLLIRE*TKLLI*RSAIFN- 

C NFY*LENKLNY*YRDQRSS I- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK* KLN* S *TI KNTNLGQN 

b CRNKSCTKVRQ*KIQTLVKI 

c DEIKAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + - --- + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIR Q* IKC 

b LRKEKKTS *QKKK* GNK*NE 

c CGRKRR PVSKRKNKAI NKM S- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQKCRNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSI IYCKEGF- 

c TEVKK*KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
C GVLGFW 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VY I RNEL Y I RTTTNY I VAFC WHKNTQ P F I EKYFNKAVLL PNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKI LGKKYPKTEEEYKAAFGDS AS APFNPELjAGKRMKI E I S KTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQ Y INS I ELAI KI AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSS IVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNYI KI FGMSDS ILKF I S AKQGGANM 
VEVIKNFALQKIGQK 

FIG. 43 

MS RRNQKKPQAP I GNETNLDFVLQNLE VYKSQ I EHYKTQQQQ I K 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVS I YANQKTSYCWWIiKDYFNKNNYDHLNVS INRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQ YLVNS SSQI S VKDSQLQ VYS FS TDLKLVDTNKVQDY FKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QL E ELTVSE VHKQVWENHKQ KAF YE PLCE F I KE S SQTLQL I D FD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 45 

MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNED VNNS L FCH S ANVNVTLL KGAAW KMFHS LVGT YAF VDLL I 
NYTVIQFNGQFFTQ IVGNRCNE PHLPPKWVQRS SS S SATAAQ I K 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I F PTNLVKI PQRLKVRINLTLQKLLKRHKRLNYVS ILNS I CP PL 
EGTVLDLSHLSRQS PKERVLKF 1 1 VI LQKLLPQEMFGS KKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICF IS WLFRQLI PKI IQTFFYCTE I S STVT I VYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
IPKXSNNEFRIIAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTS FTKI YS PTQ I ADR I KE FKQRLLKKFNNVLPELYFMK 
FDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KL FNWNAS RV P KP YE L Y I DNVRTVHL SNQD V I NWE ME I F KT A 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLI ISTDQQQVTNIKKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
SKGIFRSLIAL FNTR I S YKT I DTNLNS TNTVLMQ I DHWKN I S E 
C YKS AFKDLS INVTQNMQFHS FLQR 1 1 EMTVSGCP I TKCD PL I E 
YEVRFTILNGFLESLSSNTSKFKDNI ILLRKEIQHLQAYIYI YI 
HIVN 



FIG. 46 




52/103 



1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
3 61 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgcc tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
13 81 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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Oxytricha 
Euplotes 



LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
LCVSSILSSFYYATLEESSLGFLRDESMNPENPNWLLMRLT 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTAGCCCA 

AGAAATGTTTGGCT CAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAAT CTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAAGAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTAT AC CAAAAAAAAGT AAT AATG AGTT C AGGATT AT TGC C ATC C CATG CAG 

AGGGG CAGACG AAGAAGAATT CAC AATTT AT AAGGAGAAT CACAAAAATGCT AT C CAGC C 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TT CT C CAACGC AAATAG CTGAC CGT AT C AAAGAATTTAAG C AGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGAC C AACAGC AAGTGAT C AAT AT CAAAAAG CTTG C CATGGGCGGAT TT CAAAAATATAA 

TGCGAAAGC CAATAGAGACAAAATTTTAGCCGTAAGCT CC CAAT CAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCA?VAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 
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MTEHHT PKSR I LRFLENQ YVYLCTLND YVQLVLRGS PAS S YSNI CERLRSDVQTS FS I FLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYL IS I LES KNWQLLLE I IGSDAMHYLLS KGS I FEALPNDNYLQ I SGI PLFKNNVFEETVSKKRKR 
TIETS I TQNKS ARKE VS WNS I S I SRFS I F YRSS YKKFKQDL YFNLHS I CDRNTVHMWLQWI FPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPKQVFAFLRSILVRVFPKXtlWGNQRIFEI ILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQI FAEF I YWLYNS FI I P I LQS FFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKOT 

QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATK^FVSEAFSYFDWPFEKWQLLSMKTSDTLFVDFTO 
WTKSSSEIFK^LKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITVNKKDAKXFLNLSLiRGFEKHNFSTSLEKTVINFENSNGI INNTFFNESKKRMPFFG 
FSVNMRS LDTLLAC PKIDEALFNSTS VELTKHMGKS FFYKI LRS S LAS FAQW ID I THNS KFNSCCN 
■ IYRLGYSMCMRAQAYLKRMKX)IFIPQRMFITDLLNVIGRKIWKXLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPS FKYHPCFEQL I YQFQSLTDL I KPLRPVLRQVLFLHRRI AD 
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Wild Type Teiomerase Gene 
Wild Type Teiomerase Gene 



1 . Transform with linear fragment 
containing the teiomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Teiomerase Gene::M 



Wild Type Teiomerase Gene 




Wild Type Teiomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a teiomerase subunit.) 
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heterozygote diploid 
wt diploid 
wt haploid 



Tetrad 




Generations 
after sporulation 



Telomeric 
Apa I 
fragment 




600 bp 
500 bp 
400 bp 

300 bp 
200 bp 

100 bp 
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i 

met ser val tyr val vai glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

gin' gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 



240 

leu. cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT CGG 



250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 



260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC- TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 



leu gin thr val cys 
CTC CAG ACG GTG TGC 

400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



390 

thr asn ile tyr lys 
ACC AAC ATC TAC AAG 



ala cys val leu gin 
GCA TGT GTG CTG CAG 



ile leu leu leu gin 
ATC CTC CTG CTG CAG 

410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 
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420 

din val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 n 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC. TCT GCC CTC CGA GGC 



460 — , 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro" cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro qly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 n 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin . 

CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arq gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTT CAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

251 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

'3 01 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGC CTT C 
401 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 

4 51 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
6 01 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
651 CAACGGGCCT GGAAC CAT AG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
8 01 C CGTTGGGC A GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

8 51 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 

9 01 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

10 01 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

12 01 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAAC CACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

13 01 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

13 51 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG C CTGGTGC AG 

14 01 CTGCTCCGCC AG C AC AG C AG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

14 51 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

15 01 AACGCCGCTT CCTCAGGAAC AC CAAGAAGT TCATCTCCCT GGGGAAGCAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

16 01 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

16 51 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 

18 01 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 

18 51 CTGTCGGAAG CAGAGGT C AG GC AG CAT CGG GAAGCCAGGC CCGCCCTGCT 

19 01 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAG C C AG AA CGTTCCGCAG AGAAAAGAGG 
2 0 01 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTT CAGCG TGCTCAACTA 
2 0 51 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
22 01 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGC AT CAT C A 

22 51 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 

23 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 

23 51 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 

24 01 GCCCGCTGAG GGATGCCGTC GT CAT CGAG C AGAGCTCCTC CCTGAATGAG 
2451 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTT CATGT GCCACCACGC 
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25 01 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
2 551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

26 01 AACAAG CTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
26 51 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
2701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
28 01 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
2851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC C AG CTATG C C 
2 901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 

2 951 GAGGAACATG CGT CGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 

3 0 01 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
3 051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3 2 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3251 CGAGGC CGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 

33 51 ACGCAG CTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 

34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACC AT C CTGGACTGAT 
3 451 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3 501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
"3 551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3 601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
3 651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3 701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3 751 CTCCCCACAT AGGAATAGT C CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3 8 01 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAG AC C CTGA 
3 851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGAC CAAAG GTGTGCCCTG 
3 901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 

3 951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 

4 001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 



X + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGS PGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

C SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

61 + -- + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL- 

c ALPAAE PCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLATFVRRLGPQGWRLVQRG- 

C RWPRS CGAWGPRAGGWC S A G - 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCAlsrrGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a GPGGFPR7GGP? 7GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

C TRRLSARWWP7AWCACPG? ?-. 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ?AAPRRPLLPPGVLPE??GG 

b ??PPAAPSFRQVSCL??LVA- 

c G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGGANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 
301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

a PSAA?AVR?RRE?RAGLRLR 

b. RVL? ?LC?RGA?NV LAFGFA- 

c EC t C??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQRAQL 

b LLDGARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR 

C CPTR* PTHCGGAGRGGCCCA- 
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PRGRRRAGSPAGTLR? ? C A G 
RVGDDVLVHLLARCA? F V L V - 
AWATTCWFTCWHAAR? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

541 + + + + + + 600 

- CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

GSQLRLP ?VRAAAVPARRC ? 
? PSCAY?VCGPPLYQLGAAT- 
? PAAPT ? CAGRRCTS SAL ? L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

601 + + + + + + 660 . 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

SGPAPATR?WTR?RLGSNGP 
QARPPPHA?GPE?VWDPTGL- 
RPGPRHTL?DP?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

GTIASGRPGSPWAASPGCEE 
E p* RQGGRGPPGLPAPGA RR 
NHSVREAGVPLGCQPRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

ARGQCQPKSAVAQEAQAWRC 
RGGS ASRSLPLPKRPRRGAA- 
AGAVPAEVCRCPRGPGVALP- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

P * AGADARWAGVLG P PGQDA. 
PEPERTPVGQGSWAH PGRTP 
LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

WTE * PWFLCG'VTCQTRRRSH 
GPSDRGFCVVS PARPAEEAT 
DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + — + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

LFGGCALWHAPLPP IRGPPA 
SLEGALSGTRHSHPSVGRQH- 
LWRVRSLARAT PTH PWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 

FIG. 72B 
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PRGPPIHIAATTS WDTPCPP 
HAGP PSTSRPPRPGTRLVPR - 
TRAP H P H RGHHVLG, HA L S P G - 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACT 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTG 

VYAETKHFLYSSGDK?TA?L 
CTPRPS.TSS'TPQAT?TLRPS 
VRRDQALPLLLRRQ?HC? P P - 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTO 

1081 + + + + + + H40 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

LPTQYI *GPA*LAFGRFVET 
FLLNISEAQPDWRSGGSWR? - 
SYS I YLRPSLTGVREVRGD?- 

OTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCT 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

?FLVPG LGCQDSPQVAPPAP 
SFWFQALDARI PRRLPRLPQ - 
LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGC^ATGACCGTTTACGCCGGGGACAAAG 

A? LANAAPVSGAAWE PRAV .P 
RYWQMRPLFLELLGNHAQCP - 
?TGKCGPCFWSCLGTTRSAP-. 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

LRGV P Q DAL PAAS C.GH P S S R 
YGVF L.KTH C P LRAAVT PAAG 
TGCS S RRTARCELRS PQQ PV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

CLCPGEAPGL'CGGPRGGGTQ.- 
VCAREKPQGSVAAPEEEEHR - 
SVPGRS PRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

TPVAWCSCSASTAA PGRCTA 
PPS PGAAAP PAQQPLAGVRL - 
PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGG^ 

1441 + --- + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 

FIG. 72C 
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scgpacagwcpqa'sgapgtt 
raglpapagaprplglqaqr 
vraclrrlvppglw, gs. rhne- 

aacgccgcttcctcaggaacaccaagaagttcatctccctgggg 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGG^ 

NAASSG TPRSSSPWGSMPSS 
TPLPQEHQEVHLPGEACQAL 
RRFLRNTKKF I SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGGAGGAGCC 

1561 + + + + + + ■ 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

RCRS * RGR*ACGTALGCAGA 
AAGADVEDERAGLRLAAQEP- 
LQELTWKMSVRDCAWLRRS P - 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGT^ 

162 i + + + "t -- + -- + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

QGLAVFRPQSTVCVRRSWPS 
RGWLCSGRRAPSA*GDPGQV- 
GVGCVPAAEHRLREE I L A K F - 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

1 68 1 + + + + + + 1740 

AGGACGTGACCGACTACTCAC^CATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

SCTG**VCTSSS'CSGLSFMS 
PALADECVRRRAAQVFLLCH 
LHWLMSVYVVELLRS FFYVT-. 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGT 

174! + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGAT 

RR'PRFKRTGSFSTG.RVSGAS 
GDHVSKEQALFLPEECLEQV- 
ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
1801 + + + + + + 1860 

acgtttcgtaagcttagtctgtcgtgaacttctcccacgtcgacgccctcgacagccttc 

ckalesdst*'rgcscgscrk 
akhwnqtaleegaaagavgs - 
qs ig i rqhlkrvqlrelsea- 

cagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactccgcttca 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

QRSG,SIGKPGPPC*RPDSAS 
RGQAASGSQARPADVQTPLH - 
EVRQHREARPALLTS RLRFI- 

TC CCCAAGC CTGACGGGCTGCGGC CGATTGTGAACATGGACTACGTCGTGGGAGC CAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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SPSLTGCGRL*TW*TTSWEPE 
p Q A * RAAADCEHGLRRGSQ.N - 
PKPDGLRPIVNMDY. VVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 

_ + + + + + 2C 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACT^ 



RSAEKRGPSV*SPRG*RHCSA 
VPQRKE GRASHLEGEGTVQR 
FRREKRAERLTSRVKALFSV- 



TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCT^ 

+ + + + + + 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 



CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRPPGRLCAGPG - 
LNYERARRPGLLGASVLGLD- 



ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

+ + + + + + 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 



TISTGPGAPSCCVCGPRTRR 
RYPQGLAHLRAACAGPGPAA- 
DIHRAWRTFVLRVRAQDPPP- 



CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

, + + + + + 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 



LSCTLSRWM*RARTTPSPRT 
*AVLCQGGCDGRVRHHP PGQ 
ELYFVKVDVTGAYDT I PQDR 



GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

+ h I H + 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTITGGGGTCTTGTGCATGACGCACGCAGCCA 



G SRR S S PAS SNPRT.RTACVG 
AHGGH RQHHQT PEHVLRASV 
LTEVIAS I I KPQNTY CVRRY- 



ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

+ + y H + 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 



MPWSRRPPMGTSARPSRATS 
CRGPEGRPWARPQGLQE PRL 
AVVQ KAAHGHVR KAF KS HVS 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

+ + + + + 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGGACGTCCTNTTGT 



LP*QTSSRTCDSSWLTCR?T 
YLDRPPAVHATVRGS P A G ? Q - 
TLTDLQPYMRQFVAHLQ ? N S - 



GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

+ + + + + 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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AR*GMPSSSSRAP'P*MRPAV 
PAEGCRRHRAELLPE * GQQW- 
PLRDAVVIEQSSSL.NE. ASSG- 



GCCTCTTCGACGTCTTCCTACGCTTCATGTGC 

+ + + + + + 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 



ASSTSSYASCATTPCASGAS 
PLRRLP.TLHVPPRRAHQGQV- 
LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGC^ 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

PTSSARGSRRA PSSPRCSAA 
LRPVPGDPAGLHPLHAALQP- 
YVQCQGI PQGS ILSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

CATATWRTSCLRGFGGTGCS 
VLRRHGEQAVCGDSAGRAAP- 
CYGDMENKLFAGI RRDGLLL- 



TGCGTTTGGTGGATGATTTCTTGTTGGT^ 

+ + + + + + 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCIUU'1'GGAAGG 



CVWWMISCW*HLTSPTRKP"S 
AFGG* FLVGDTSPHPRENLP- 
RLVDDFLLVTPHLTHAKTFL- 



TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

, H + + + + 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACC^^ 



SGPWSEVSLSMAAW.*TCGRQ 
QDPGPRCP*VWLRGELAEDS 
RTLVRGV PEYGCVVNLRKTV 



TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

+ H + + + + 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 



W*TSL* KTRP*WVARLLFRCR 
GELPCRRRGPGWHGFCSDAG- 
VNFPVEDEALGGTAFVQMPA- 



CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

+ + + + + + 

GGGTGC CGG AT AAGGGGAC CACGC CGGACGACGAC CTATGGGC CTGGGAC CT CCACGTCT 



PTAYSPGAACCWIPGPWRCR 
PRPIPLVRPAAGYPDPGGAE - 
HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCT^ 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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a ATTPAMPGPPSEP'VSPSTAA 

b RLL QLCPDLHQSQSHLQPRL- 

c DYSSYARTS IRASL,TFNRGF- 



TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGG^ 

+ + + h + + 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACT^ 



a SRLGGTCVANSLGSCG* SVT 

b QGWEEHASQTLWGLAAEVSQ 
C KAGRNMRRKLFGVLRLKCHS 



GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCAC 

, + -| + + + 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 



a ACFWICR*TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD 

c LFLDLQVNSLQTVCTN I Y K I - 



TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCC^ 

+ + + "+ H + 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 



a SSCCRRTGFTHVCCSSHFIS 

b ppAAGVQVSRMCAAAPISSA- 

C LLLQAYRFHACVLQLPFHQQ- 



AAGTTTGGAAG^CCCC^CATTTTTCCTGCGCGTC^TCTCTGACACGGCCTCCCTCTGCT 

+ + + y + + 

ttcaaaccttcttggggtgtaaaaaggacgcgcag™ 



a KFGRTPHFSCASSLTRPPSA 

b SLEEPHIFPARHL*HGLPLL- 

c VWKNPTFFLRVI SDTASLCY- 



ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

, + + y y + 

TGAGGTAGGACTTTCGGTTCTTGCGTC^ 



a T ps*KPRTQGCRWG.PRAPPA 
b LHPESQERRDVAGGQGRRRP 
C S I LKAKNAGMS LGAKGAAGP 



CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

+ + + + + + 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 



a L CPPRPCSGCATKHSCSS*L 
b SALRGRAVAVPPS IPAQADS 

C LPSEAVQWLCHQAFLLKLTR 



GACACCGTGTCAC CTACGTGC CACTC CTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

+ + + + + v 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 



a DTVSPTCHSWGHSGQPRRS * 

b TPCHLRATPGVTQDS PDAAE 

C HRVTYVPLLGS LRTAQTQLS- 



GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCnX^GGCCGCAGCCAACCCGGCACTGC 

+ + + h + + 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 

FIG. 72G 
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b SEAPGDDADCPGGRSQPGTA- 
c RKLPGTTLTALEAA, ANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCC^ 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDH.PGLMATRPQPGREQ- 

c SDFKT I L D * WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSSPVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

c HQQ PCHAGLYVPGREGR PTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 

3541 + + + + -" + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGLHV- 

c RPAPLGV*GLSECLiAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCC^GGGCTGAGTG 

360! + + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACT 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC 

c G*RLSVRLRPERVSSQGLSV-.. 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a S STPAVFTS PQAGA RLHPRA 

b PAHLPSSLPHRLALGSTPGP 

c QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTC 

3721 + + + - + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLP'LPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

c LFLTRS PASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACC^ 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

c RHCS PLALPS FAFHPHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC - 

c DPEKD PGS SGNL. E * PKVC P V - . 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGT 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

a - YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV - 

C HRRG P C TWMGVPV GQ I GGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTT^ 

3961 + + + + + + 4020 

CGACACCCTCATTTTATGACTTATATACT 

a AVGVKY* IYEFFSFEKKKKK 

b LWE*NTEYMS FSVLKKKKKK- 

C CGSKI LNI * V F Q F * KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

a K K K 

b K K - 

C K K 
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met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 n 

his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 



phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 



100 



gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 



110 



120 



ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala .leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 

GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 

gly ala leu ser gly thr arg his ser his pro ser val gly arg 

GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



430 
arg glu lys 
CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



val thr glu 
GTC ACG GAG 

580 

pro ser val 
CCG AGT GTC 



leu lys arg 
TTG AAG AGG 
610 

gin his arg 
CAG CAT CGG 



phe ile pro 
TTC ATC CCC 

640 

tyr val val 
TAC GTC GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr. thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA. GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 

CCA GGG GTT 

540 
glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 
arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 
gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 
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650 660 
arg glu lys arg ala glu arg leu thr ser , arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 



thr phe val leu arg val arg 
ACC TTC GTG CTG CGT GTG CGG 

710 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 

740 

asn thr tyr cys val arg arg 
AAC ACG TAC TGC GTG CGT CGG 



his gly his val arg lys ala 
CAT GGG CAC GTC CGC AAG GCC 

770 

thr asp leu gin pro tyr met 
ACA GAC CTC CAG CCG TAC ATG 



glu thr ser pro leu arg asp 
GAG ACC AGC CCG CTG AGG GAT 



700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 

760 

phe lys ser his val ser thr leu 
TTC AAG AGC CAC GTC TCT ACC TTG 

780 

arg gin phe val ala his leu gin 
CGA CAG TTC GTG GCT CAC CTG CAG 

790 

ala val val. ile glu gin ser ser 
GCC GTC GTC ATC GAG CAG AGC TCC 



800 

ser leu asn glu ala ser ser 
TCC CTG AAT GAG GCC AGC AGT 



phe met cys his his ala val 
TTC ATG TGC CAC CAC GCC GTG 



810 

gly le\i phe asp val phe leu arg 
GGC CTC TTC GAC GTC TTC CTA CGC 

820 

arg ile arg gly lys ser tyr val 
CGC ATC AGG GGC AAG TCC TAC GTC 



830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 



FIG. 74D 



$ 



102/103 

880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 
thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 

940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 - - • 960 

ser ser tyr ala arg thr ser ile arg ala ser val thr phe asn 
TCC AGC. TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG- CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 1110 

arg thrala gin thr gin leu ser arg lys leu pro gly thr thr 

AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 

CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCC AC C CG C C C ACAGC C AGGC CGAGAG CAGA 



CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 
AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 
GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 
CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 
CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 
CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 
GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 
CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 
TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 



AAAAAAAAAA 
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